SEQUENCE LISTING 

All gene sequences are of 552 base pairs of linear single stranded DNA. 

5 All protein sequences are of 181 amino acids in length and have enzymatic properties. It 
should be noted that the sequences presented here are identical to the published lip A 
sequence except for the first amino acid , methinine present in our sequence. The 
sequence purified and crysallized in literature does not have the N-terminal methionine. 

1 0 GENERAL INFORMATION 

APPLICANT: COUNCIL OF SCIENTIFIC AND INDUSTRAIL RESEARCH 
TITLE: STABLE GENE VARIANTS OF LIPASES 
NUMBER OF SEQUENCES 14 
1 5 CORRESPONDING ADDRESS: Centre for Cellular and Molecular Biology, Uppal 
Road, Hyderbad-500 007 .INDIA 

Protein Seq ID No. 1 
INFORMATION FOR SEQ ID NO: 1 
20 (i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: enzyme 

25 (iii) SEQUENCE DESCRIPTION: SEQ ID NO:l 

MAEHNPVVMVHG1GGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNN 
GPVLSRFVQKVLDETGAKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGG 
ANRLTTGKALPGTDPNQKILYTSIYSSADMIVMNYLSRLDGARNVQIHGGHIGLL 
30 YSSQVNSLIKEGLNGGGQNTN 

CORRESPONDING GENE SEQUENCE OF PROTEIN SEQ ID NO:l 

- (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 552 

35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: DNA 

- (iii) HYPOTHETICAL: NO 

40 

atggctgaacaca atccagtcgt tatggttcacggtattggag gggcatcatt caattttgcg ggaattaaga gctatctcgt 
atctcagggctggtcgcggg acaagctgta tgcagttgat ttttgggaca agacaggcac aaattataacaatggaccgg 
tattatcacg atttgtgcaa aaggttttag atgaaacggg tgcgaaaaaagtggatattg tcgctcacag catggggggc 
gcgaacacac tttactacat aaaaaatctggacggcggaa ataaagttgc aaacgtcgtg acggttggcg gcgcgaaccg 
45 utgacgacaggcaaggcgc ttccgggaac agatccaaat caaaagattt tatacacatc catttacagc agtgccgata 
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tgattgtcat gaattactta tcaagattag atggtgctag aaacgttcaaatccatggcg ttggacacat cggccttctg 
tacagcagcc aagtcaacag cctgattaaagaagggctga acggcggggg ccagaatacg aattaatga 
ORGANISM: Bacillus subtilis 

IMMEDIATE: NATURAL 

5 NAME/KEY: Natural 

SEQUENCE ID No. 1 



Protein Seq ID No. 2 
1 0 INFORMATION FOR SEQ ID NO: 1 

- (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 181 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
15 - (ii) MOLECULE TYPE: enzyme 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:2 



MAEHNPVVMVHGIGGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNN 
GPVLSRFVQKVLDETGAKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGG 
20 ANRLTTGKALPGTDPNQKILYTSIYSSADMIVMNYLSRLDGARNVQIHGGHIGLL 
YSSQVYSLIKEGLNGGGQNTN 

CORRESPONDING GENE SEQUENCE OF PROTEIN SEQ ID NO:2 

- (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 552 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: DNA 

- (iii) HYPOTHETICAL: NO 

atggctgaacaca atccagtcgt tatggttcacggtattggag gggcatcatt caattttgcg ggaattaaga gctatctcgt 
atctcagggctggtcgcggg acaagctgta tgcagttgat ttttgggaca agacaggcac aaattataacaatggaccgg 
lattatcacg atttgtgcaa aaggttttag atgaaacggg tgcgaaaaaagtggatattg tcgctcacag catggggggc 
35 gcgaacacac tttactacat aaaaaatctggacggcggaa ataaagttgc aaacgtcgtg acggttggcg gcgcgaaccg 
tttgacgacaggcaaggcgc ttccgggaac agatccaaat caaaagattt tatacacatc catttacagc agtgccgata 
tgattgtcat gaattactta tcaagattag atggtgctag aaacgttcaaatccatggcg ttggacacat cggccttctg 
tacagcagcc aagtctacag cctgattaaagaagggctga acggcggggg ccagaatacg aattaatga 
ORGANISM: Artifical 

40 IMMEDIATE: Aritifical 

NAME/KEY: Aritifical 

SEQUENCE ID No.2 



25 



30 
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Protein Seq ID No.3 
INFORMATION FOR SEQ ID NO:3 

- (i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: enzyme 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:3 
1 0 MAEHNPVVMVHGIGGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNN 
GPVLSRFVQKVLDETGVKXVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGG 
ANRLTTGKALPGTDPNQKILYTSIYSSDDMIVMNYLSRLDGARNVQIHGGHIGLL 
YSSQVV'SLIKEGLNGGGQNTN 

1 5 CORRESPONDING GENE SEQUENCE OF PROTEIN SEQ ID NO:3 

- (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 552 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: DNA 
(iii) HYPOTHETICAL: NO 

atggctgaacaca atccagtcgt tatggttcacggtattggag gggcatcatt caattttgcg ggaattaaga gctatctcgt 
25 atctcagggctggtcgcggg acaagctgta tgcagttgat ttttgggaca agacaggcac aaattataacaatggaccgg 
lattatcacg atttgtgcaa aaggttttag atgaaacggg tgtgaaaaaagtggatattg tcgctcacag catggggggc 
gcgaacacac tttactacat aaaaaatctggacggcggaa ataaagttgc aaacgtcgtg acggttggcg gcgcgaaccg 
tttgacgacaggcaaggcgc ttccgggaac agatccaaat caaaagattt tatacacatc catttacagc agtgacgata 
tgattgtcat gaattactta tcaagattag atggtgctag aaacgttcaaatccatggcg ttggacacat cggccttctg 
30 tacagcagcc aagtctacag cctgattaaagaagggctga acggcggggg ccagaatacg aattaatga 
ORGANISM: Artifical 

IMMEDIATE: Aritifical 

NAME/KEY: Aritifical 

SEQUENCE ID No.3 

35 Protein Seq ID No.4 

INFORMATION FOR SEQ ID NO:4 

- (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: enzyme 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:4 

MAEHNPVVMVHG1GGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNN 
GPVLSRFVQKVLDETGTKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGG 
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ANRLTTGKAPPGTDPNQKILYTSIYSSADMIVMNYLSRLDGARNVQIHGGHIGLL 
Y S S Q V Y S LIKEG LNG GGQNTN 

CORRESPONDING GENE SEQUENCE OF PROTEIN SEQ ID NO:4 
5 - (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 552 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 - (ii) MOLECULE TYPE: DNA 

- (iii) HYPOTHETICAL: NO 

atggctgaacaca atccagtcgt tatggttcacggtattggag gggcatcatt caattttgcg ggaattaaga gctatctcgt 
atctcagggctggtcgcggg acaagctgta tgcagttgat ttttgggaca agacaggcac aaattataacaatggaccgg 
tattatcacg atttgtgcaa aaggttttag atgaaacggg tacgaaaaaagtggatattg tcgctcacag catggggggc 
1 5 gcgaacacac tttactacat aaaaaatctggacggcggaa ataaagttgc aaacgtcgtg acggttggcg gcgcgaaccg 
tttgacgacaggcaaggcgcct ccgggaac agatccaaat caaaagattt tatacacatc catttacagc agtgccgata 
tgattgtcat gaattactta tcaagattag atggtgctag aaacgttcaaatccatggcg ttggacacat cggccttctg 
tacagcagcc aagtctacag cctgattaaagaagggctga acggcggggg ccagaatacg aattaatga 
ORGANISM: Artifical 

20 IMMEDIATE: Aritifical 

NAME/KEY: Aritifical 

SEQUENCE ID No.4 

Protein Seq ID No.5 
INFORMATION FOR SEQ ID NO:5 
25 - (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 181 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: enzyme 

30 (iii) SEQUENCE DESCRIPTION: SEQ ID NO:5 

MAEHNPVVMVHGIGGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNN 
GPVLSRFVQKVLDETGAKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGG 
ANRLTTGKALPGTDPNQKILYTSIYSSDDMIVMNYLSRLDGARNVQIHGGHIGLL 
YSSQVYSLIKEGLNGGGQNTN 

35 

CORRESPONDING GENE SEQUENCE OF PROTEIN SEQ ID NO:5 

- (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 552 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: DNA 

- (iii) HYPOTHETICAL: NO 
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atggctgaacaca atccagtcgt tatggttcacggtattggag gggcatcatt caattttgcg ggaattaaga gctatctcgt 
atctcagggctggtcgcggg acaagctgta tgcagttgat ttttgggaca agacaggcac aaattataacaatggaccgg 
tattatcacg atttgtgcaa aaggttttag atgaaacggg tgcgaaaaaagtggatattg tcgctcacag catggggggc 
gcgaacacac tttactacat aaaaaatctggacggcggaa ataaagttgc aaacgtcgtg acggttggcg gcgcgaaccg 
5 tttgacgacaggcaaggcgc ttccgggaac agatccaaat caaaagattt tatacacatc catttacagc agtgacgata 
tgattgtcat gaattactta tcaagattag atggtgctag aaacgttcaaatccatggcg ttggacacat cggccttctg 
tacagcagcc aagtctacag cctgattaaagaagggctga acggcggggg ccagaatacg aattaatga 
ORGANISM: Artifical 

IMMEDIATE: Aritifical 

10 NAME/KEY: Aritifical 

SEQUENCE ID No.5 

Protein Seq ID No.6 
INFORMATION FOR SEQ ID NO: 6 
15 - (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: enzyme 

20 (iii) SEQUENCE DESCRIPTION: SEQ ID NO:6 

MAEHNPVVMVHGIGGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNN 
GPVLSRFVQKVLDETGAKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGG 
ANRLTTGKAFPGTDPNQKILYTSIYSSDDMIVMNYLSRLDGARNVQ1HGGHIGLL 
Y S S Q V Y S LIKEGLNGG GQNTN 

25 

CORRESPONDING GENE SEQUENCE OF PROTEIN SEQ ID NO:6 

- (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 552 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: DNA 

- (iii) HYPOTHETICAL: NO 

atggctgaacaca atccagtcgt tatggttcacggtattggag gggcatcatt caattttgcg ggaattaaga gctatctcgt 
35 atctcagggctggtcgcggg acaagctgta tgcagttgat ttttgggaca agacaggcac aaattataacaatggaccgg 
tattatcacg atttgtgcaa aaggttttag atgaaacggg tgcgaaaaaagtggatattg tcgctcacag catggggggc 
gcgaacacac tttactacat aaaaaatctggacggcggaa ataaagttgc aaacgtcgtg acggttggcg gcgcgaaccg 
tttgacgacaggcaaggcgcccf ccgggaac agatccaaat caaaagattt tatacacatc catttacagc agtgacgata 
tgattgtcat gaattactta tcaagattag atggtgctag aaacgttcaaatccatggcg ttggacacat cggccttctg 
40 tacagcagcc aagtctacag cctgattaaagaagggctga acggcggggg ccagaatacg aattaatga 
ORGANISM: Artifical 

IMMEDIATE: Aritifical 

NAME/KEY: Aritifical 
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SEQUENCE ID No.6 

Protein Seq ID No.7 
INFORMATION FOR SEQ ID NO:7 

- (i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: enzyme 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:7 

10 

MAEHNPVVMVHGIGGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNN 
GPVLSRFVQKVLDETGAKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGG 
ANRLTTGKALPGTDPNQKILYTSIYSSADMIVMNYLSRLDGASNVQIHGGHIGLL 
YSSQVYSLIKEGLNGGGQNTN 

15 

CORRESPONDING GENE SEQUENCE OF PROTEIN SEQ ID NO:7 

- (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 552 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear . . 

- (ii) MOLECULE TYPE: DNA 

- (iii) HYPOTHETICAL: NO 

atggctgaacaca atccagtcgt tatggttcacggtattggag gggcatcatt caattttgcg ggaattaaga gctatctcgt 
25 atctcagggctggtcgcggg acaagctgta tgcagttgat ttttgggaca agacaggcac aaattataacaatggaccgg 
tattatcacg atttgtgcaa aaggttttag atgaaacggg tgcgaaaaaagtggatattg tcgctcacag catggggggc 
gcgaacacac tttactacat aaaaaatctggacggcggaa ataaagttgc aaacgtcgtg acggttggcg gcgcgaaccg 
tttgacgacaggcaaggcgc ttccgggaac agatccaaat caaaagattt tatacacatc catttacagc agtgccgata 
tgattgtcat gaattactta tcaagattag atggtgctagtaacgttcaaatccatggcg ttggacacat cggccttctg 
30 tacagcagcc aagtctacag cctgattaaagaagggctga acggcggggg ccagaatacg aattaatga 
ORGANISM: Artifical 

IMMEDIATE: Aritifical 

NAME/KEY: Aritifical 

SEQUENCE ID No.7 

35 

Protein Seq ID No. 8 
INFORMATION FOR SEQ ID NO:8 

- (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 181 amino acids 

40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: enzyme 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:8 
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MAEHNPVVMVHG1GGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNN 
GPVLSRFVQKVLDETGAKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGG 
ANRLTTGKAPPGTDPNQKILYTSIYSSADMIVMNYLSRLDGARNVQIHGGHIGLL 
YSSQVYSLIKEGLNGGGQNTN 

5 

CORRESPONDING GENE SEQUENCE OF PROTEIN SEQ ID NO:8 

- (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 552 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: DNA 

- (iii) HYPOTHETICAL: NO 

atggctgaacaca atccagtcgt tatggttcacggtattggag gggcatcatt caattttgcg ggaattaaga gctatctcgt 
}5 atctcagggctggtcgcggg acaagctgta tgcagttgat ttttgggaca agacaggcac aaattataacaatggaccgg 
tattatcacg atttgtgcaa aaggttttag atgaaacggg tgcgaaaaaagtggatattg tcgctcacag catggggggc 
gcgaacacac tttactacat aaaaaatctggacggcggaa ataaagttgc aaacgtcgtg acggttggcg gcgcgaaccg 
tttgacgacaggcaaggcgccct ccgggaac agatccaaat caaaagattt tatacacatc catttacagc agtgccgata 
tgattgtcat gaattactta tcaagattag atggtgctag aaacgttcaaatccatggcg ttggacacat cggccttctg 
20 tacagcagcc aagtctacag cctgattaaagaagggctga acggcggggg ccagaatacg aattaatga 
ORGANISM: Artifical 

IMMEDIATE: Aritifical 

NAME/KEY: Aritifical 

SEQUENCE ID No.8 

25 Protein Seq ID No.9 

INFORMATION FOR SEQ ID NO:9 

- (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: enzyme 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO:9 

MAEHNPVVMVHGIGGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNN 
GPVLSRFVQKVLDETGAKKADIVAHSMGGANTLYYIKNLDGGNKVANVVTLGG 
35 ANRLTTGKALPGTDPNQKILYTSIYSSADMIVMNYLSRLDGARNVQIHGGHIGLL 
YSSQVYSLIKEGLNGGGQNTN 

CORRESPONDING GENE SEQUENCE OF PROTEIN SEQ ID NO:9 

- (i) SEQUENCE CHARACTERISTICS: 
40 #pairs (A) LENGTH: 552 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: DNA 
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- (iii) HYPOTHETICAL: NO 

atggctgaacaca atccagtcgt tatggttcacggtattggag gggcatcatt caattttgcg ggaattaaga gctatctcgt 
atctcagggctggtcgcggg acaagctgta tgcagttgat ttttgggaca agacaggcac aaattataacaatggaccgg 
tattatcacg atttgtgcaa aaggttttag atgaaacggg tgcgaaaaaagcggatattg tcgctcacag catggggggc 
5 gcgaacacac tttactacat aaaaaatctggacggcggaa ataaagttgc aaacgtcgtg acggttggcg gcgcgaaccg 
tttgacgacaggcaaggcgc ttccgggaac agatccaaat caaaagattt tatacacatc catttacagc agtgccgata 
tgattgtcat gaattactta tcaagattag atggtgctag aaacgttcaaatccatggcg ttggacacat cggccttctg 
lacagcagcc aagtctacag cctgattaaagaagggctga acggcggggg ccagaatacg aattaatga 
ORGANISM: Artifical 

1 0 IMMEDIATE: Aritifical 

NAME/KEY: Aritifical 

SEQUENCE ID No.9 

Protein Seq ID No. 10 
1 5 INFORMATION FOR SEQ ID NO: 1 0 

- (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 - (ii) MOLECULE TYPE: enzyme 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 1 0 

MAEHNPVVMVHG1GGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNN 
GPVLSRFVQKVLDETGAKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGG 
ANRLTTGKALPGTDPNQKILYTSIYSSADMIVMNYLSRLVGARNVQIHGGHIGLL 
25 YSSQVYSLIKEGLNGGGQNTN 

CORRESPONDING GENE SEQUENCE OF PROTEIN SEQ ID NO: 1 0 

- (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 552 

30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: DNA 

- (iii) HYPOTHETICAL: NO 

35 atggctgaacaca atccagtcgt tatggttcacggtattggag gggcatcatt caattttgcg ggaattaaga gctatctcgt 
atctcagggctggtcgcggg acaagctgta tgcagttgat ttttgggaca agacaggcac aaattataacaatggaccgg 
tattatcacg atttgtgcaa aaggttttag atgaaacggg tgcgaaaaaagtggatattg tcgctcacag catggggggc 
gcgaacacac tttactacat aaaaaatctggacggcggaa ataaagttgc aaacgtcgtg acggttggcg gcgcgaaccg 
tttgacgacaggcaaggcgc ttccgggaac agatccaaat caaaagattt tatacacatc catttacagc agtgccgata 

40 tgattgtcat gaattactta tcaagatta gttggtgctag aaacgttcaaatccatggcg ttggacacat cggccttctg 
tacagcagcc aagtctacag cctgattaaagaagggctga acggcggggg ccagaatacg aattaatga 
ORGANISM: Artifical 

IMMEDIATE: Aritifical 
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NAME/KEY: Aritifical 



SEQUENCE ID No. 10 

ProteinSeqIDNo.il 
INFORMATION FOR SEQ ID NO: 1 1 
5 - (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: enzyme 

10 (iii) SEQUENCE DESCRIPTION: SEQ ID NO: 1 1 

MAEHNPVVMVHGIGGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNN 
GPVLSRFVQKVLDETGVKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGG 
ANRLTTGKALPGTDPDQKILYTSIYSSDDMIVMNYLSRLDGARNVQIHGGHIGLL 
15 YSSQVYSLIKEGLNGGGQNTN 

CORRESPONDING GENE SEQUENCE OF PROTEIN SEQ ID NO:l 1 

- (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 552 

20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: DNA 

- (iii) HYPOTHETICAL: NO 

25 atggctgaacaca atccagtcgt tatggttcacggtattggag gggcatcatt caattttgcg ggaattaaga gctatctcgt 
atctcagggctggtcgcggg acaagctgta tgcagttgat ttttgggaca agacaggcac aaattataacaatggaccgg 
tattatcacg atttgtgcaa aaggttttag atgaaacggg tgtgaaaaaagtggatattg tcgctcacag catggggggc 
gcgaacacac tttactacat aaaaaatctggacggcggaa ataaagttgc aaacgtcgtg acggttggcg gcgcgaaccg 
tttgacgacaggcaaggcgc ttccgggaac agatccagat caaaagattt tatacacatc catttacagc agtgacgata 

30 tgattgtcat gaattactta tcaagattag atggtgctag aaacgttcaaatccatggcg ttggacacat cggccttctg 
tacagcagcc aagtctacag cctgattaaagaagggctga acggcggggg ccagaatacg aattaatga 
ORGANISM: Artifical 

IMMEDIATE: Aritifical 

NAME/KEY: Aritifical 

35 SEQUENCEIDNo.il 

Protein Seq ID No. 12 
INFORMATION FOR SEQ ID NO: 12 

- (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 181 amino acids 

40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: enzyme 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
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MAEHNPVVMVHGIGGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNN 
GPVLSRFVQKVLDETGAKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGG 
ANRLTTGKAPPGTDPDQKILYTSIYSSDDMIVMNYLSRLDGARNVQIHGGHIGLL 
YSSQVYSLIKEGLNGGGQNTN 

5 

CORRESPONDING GENE SEQUENCE OF PROTEIN SEQ ID NO: 1 2 

- (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 552 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: DNA 

- (iii) HYPOTHETICAL: NO 

atggctgaacaca atccagtcgt tatggttcacggtattggag gggcatcatt caattttgcg ggaattaaga gctatctcgt 
1 5 atctcagggctggtcgcggg acaagctgta tgcagttgat ttttgggaca agacaggcac aaattataacaatggaccgg 
tattatcacg atttgtgcaa aaggttttag atgaaacggg tgcgaaaaaagtggatattg tcgctcacag catggggggc 
gcgaacacac tttactacat aaaaaatctggacggcggaa ataaagttgc aaacgtcgtg acggttggcg gcgcgaaccg 
Mgacgacaggcaaggcgccct ccgggaac agatccagat caaaagattt tatacacatc catttacagc agtgacgata 
tgattgtcat gaattactta tcaagattag atggtgctag aaacgttcaaatccatggcg ttggacacat cggccttctg 
20 tacagcagcc aagtctacag cctgattaaagaagggctga acggcggggg ccagaatacg aattaatga 
ORGANISM: Artifical 

IMMEDIATE: Aritifical 

NAME/KEY: Aritifical 

SEQUENCE ID No. 12 

25 INFORMATION FOR SEQ ID NO: 1 3 

- (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: DNA 
(iii) HYPOTHETICAL: NO 

5 '-CGCCAGGGTTTTCCCAGTCACGAC-3 ' 
35 ORGANISM: Artifical 

IMMEDIATE: Aritifical 

NAME/KEY: Oligonucleotide 

SEQUENCE ID No. 13 

INFORMATION FOR SEQ ID NO: 14 
40 - (i) SEQUENCE CHARACTERISTICS: 
#pairs (A) LENGTH: 22 

(B) TYPE: nucleic acid 



37 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: DNA 
(iii) HYPOTHETICAL: NO 

5 

5'- TG AC AC AGG AAACAGCTATG AC-3 ' 
ORGANISM: Artifical 

IMMEDIATE: Aritifical 

NAME/KEY: Oligonucleotide 

1 0 SEQUENCE ID No. 1 4 
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